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ABSTRACT 

Using the Gemini Near-InfraRed Spectrograph (GNIRS), we have completed a near- infrared spec- 
troscopic survey for if-bright galaxies at z ~ 2.3, selected from the MUSYC survey. We derived 
spectroscopic redshifts from emission lines or from continuum features and shapes for all 36 observed 
galaxies. The continuum redshifts are driven by the Balmer/4000A break, and have an uncertainty 
in Az/(1 + z) of < 0.019. We use this unique sample to determine, for the first time, how accurately 
redshifts and other properties of massive high-redshift galaxies can be determined from broadband 
photometric data alone. We find that the photometric redshifts of the galaxies in our sample have a 
systematic error of 0.08 and a random error of 0.13 in Az/ (1 + z). The systematic error can be reduced 
by using optimal templates and deep photometry; the random error, however, will be hard to reduce 
below 5%. The spectra lead to significantly improved constraints for stellar population parameters. 
For most quantities this improvement is about equally driven by the higher spectral resolution and 
by the much reduced redshift uncertainty. Properties such as the age, Ay, current star formation 
rate, and the star formation history are generally very poorly constrained with broadband data alone. 
Interestingly stellar masses and mass-to-light ratios are among the most stable parameters from broad- 
band data. Nevertheless, photometric studies may overestimate the number of massive galaxies at 
2 < z < 3, and thus underestimate the evolution of the stellar mass density. Finally, the spectroscopy 
supports our previous finding that red galaxies dominate the high-mass end of the galaxy population 
at z — 2 — 3. 

Subject headings: galaxies: high redshift — galaxies: distances and redshifts — galaxies: evolution 



1. INTRODUCTION 

The extensive use of photometric redshifts 
greatly enhanced our knowledge of the z 
3 universe. While col or criteria, s u ch as the 
man break technique (|Steidel et al.l lL996alfbh . 
tant red galaxy select ion (DRGs, IFranx et alj 120031: 
van Dokkum et al"1l2003l ) and BzK selection (jDaddi et al.l 
2004 ) provide an easy identification of high-redshift 
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galaxies, photometric redshifts allow the study of 
apparently unbiased samples. Our current under- 
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galaxy clustering (e.g . 



l2007at iFoucaud et alj 20071 ) essentially all rely on photo- 



metric redshifts. 

Ideally, all these studies would have been based on 
spectroscopic redshifts. However, obtaining spectro- 
scopic redshifts for the required samples is hampered 
by several obstacles. Due to their faintness, obtaining 
redshifts of high-redshift galaxies requires long integra- 
tions on 8-10m class telescopes. The largest samples of 
spectroscopically con firmed high redshif t galaxies num- 
ber in the 1000s (e.g.. ISteidel et al1l2003f) . several orders 
of magnitude short of the largest photometric samples. 
Furthermore, and more fundamentally, current spectro- 
scopic samples are strongly biased towards blue, star 
forming galaxies which are bright in the observer's opti- 
cal (Lyman break galaxies, or LBGs). However, it has 
become clear that the typical massive galaxy at this red- 
shift range is red in the rest-frame optical and faint in 
the rest- frame ultra-violet (UV), and blue LBGs con- 
stitute only ~20% of m assive galaxies at z = 2 — 3 
(jvan Dokkum et al.ll20"06h . 

Obtaining spectroscopic redshifts for typical mas- 
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TABLE 1 

Sample and Observations 



Exptime 



ID 


RA 


DEC 


K s 


R 


J - K 


observation dates 


min 


IDKR06B+07 a 


1030-32 


10 30 41.5 


5 19 55 


19.68 


26.83 


2.62 


2006/12/19 + 2007/03/12 


230 




i no A 1 r\ i 

1030-101 


■in on ~t r\ -i 

10 30 10.1 


5 20 11 


18.97 


25.63 


2.15 


nnnr 1 /no /no 

2006/02/23 


120 




1030-301 


■in on p" n o 

10 30 50.8 


5 20 49 


18.82 


25.49 


2.81 


nnnr 1 /ni /nn 

2006/01/20 


90 




1030-609 


10 30 49.6 


5 21 50 


19.68 


24.23 


2.02 


2006/02/24 


130 




1030-807 


10 30 20.0 


5 22 33 


19.72 


24.77 


2.40 


2006/02/23 


120 




1030-1531 


10 30 38.9 


5 24 52 


19.38 


22.92 


2.23 


2006/02/25 


80 




1030-1611 


■in on a a a 

10 30 48.4 


5 25 03 


19.58 


25.55 


2.37 


nnnr 1 /nn /n a 

2006/02/ 24 


120 




1030-1813 


10 30 51.2 


5 25 36 


19.01 


24.97 


2.93 


2006/01/20 


80 




1030-1839 


10 30 45.4 


5 30 07 


19.61 


24.20 


2.35 


2006/12/16 


80 




1030-2026 


"i n on on t 

10 30 22.7 


5 28 26 


19.48 


25.22 


2.93 


nnnr 1 /nn /no 

2006/02/22 


120 




1030-2329 


10 30 16.2 


5 27 32 


19.72 


25.24 


2.47 


2006/02/25 


120 




1030-2559 


10 30 40.1 


5 26 34 


19.62 


25.89 


2.52 


2006/02/22 


110 


- 


1030-2728 b 


10 30 18.4 


5 26 05 


19.52 


25.09 


2.69 


2006/01/21 


120 


- 


1030-2927 


10 30 43.3 


5 29 34 


19.48 


24.52 


2.23 


2006/12/18 + 2007/03/13 


230 


- 


1256-0 


12 54 59.6 


1 11 30 


19.26 


24.98 


2.26 


2005/05/19+27+30 + 2006/02/24 


305 


151 


1256-142 


12 55 02.7 


1 07 32 


19.45 


25.99 


2.54 


2006/02/23 


120 


465 


1256-519 


12 55 08.4 


1 06 14 


18.99 


25.51 


2.55 


2006/02/25 


80 


- 


1256-1207 


12 55 19.7 


1 12 46 


19.25 


25.34 


2.04 


2006/02/25 


80 


- 


1256-1967 


12 55 25.8 


1 03 25 


18.71 


23.55 


2.02 


2005/05/18 + 2006/01/18 


240 


2889 


HDFS1-259 


22 33 11.2 


-60 40 47 


19.42 


24.00 


2.11 


2006/12/17-18 


140 


- 


HDFS1-1849 


22 33 37.9 


-60 33 15 


19.30 


25.18 


2.51 


2004/09/06 


115 


- 


HDFS2-509 


22 31 23.1 


-60 39 08 


18.57 


23.20 


2.50 


2005/05/16+19 


235 


- 


HDFS2-1099 


22 32 03.2 


-60 36 13 


19.26 


24.94 


2.55 


2006/12/19 


120 




HDFS2-2046 


22 32 30.8 


-60 32 44 


19.38 


24.24 


2.21 


2005/05/20 


125 


- 


ECDFS-4454 


3 32 11.5 


-27 55 23 


19.24 


24.28 


2.89 


2006/01/18 


100 


3662 


ECDFS-4511 


3 32 43.2 


-27 55 15 


18.77 


23.36 


2.62 


2006/01/21 + 20006/02/25 


190 


3694 


ECDFS-4713 


3 31 52.5 


-27 54 48 


18.68 


22.94 


2.13 


2006/02/22 


60 


3896 


ECDFS-5856 


3 32 13.3 


-27 52 26 


19.42 


25.42 


3.21 


2006/01/19 


120 


4937 


ECDFS-6842 


3 31 51.3 


-27 50 56 


19.09 


24.47 


2.47 


2006/12/19 + 2007/03/11+14 


210 




ECDFS-6956 


3 32 02.5 


-27 50 46 


19.16 


23.40 


2.43 


2006/01/20 + 2006/02/24 


150 


5754 


ECDFS-9822 


3 31 33.9 


-27 46 03 


19.14 


24.13 


3.04 


2006/12/17 


120 




ECDFS-11490 


3 32 45.0 


-27 43 09 


19.25 


24.01 


2.49 


2006/01/20+21 


190 


9510 


ECDFS-12514 


3 31 39.5 


-27 41 20 


19.11 


22.79 


1.72 


2006/02/23 


90 


10525 


ECDFS-13532 


3 31 54.8 


-27 39 23 


19.52 


24.98 


3.51 


2006/12/18 


160 




ECDFS-16671 


3 31 58.9 


-27 35 16 


18.99 


22.08 


1.74 


2006/12/16 


60 




CDFS-6202 


3 32 31.5 


-27 46 23 


19.04 


23.62 


2.28 


2004/09/02+03 


90 


6036 



a ID numbers in Krick ct al. (2006b, 2007)* 3 The spectroscopic redshift of this galaxy has first been confirmed using X-band spectroscopy 
with NIRSPEC on Keck, in 2005 January. 



sive galaxies requires deep spectroscopy at near-infrared 
(NIR) wavelengths. Unfortunately, NIR observations are 
complicated by the combination of the high sky bright- 
ness, numerous bright and variable night sky lines and 
strong atmospheric absorption bands, and the limited 
field of view of current and planned NIR spectrographs. 
Thus, obtaining spectroscopic rcdshifts for hundreds or 
thousands of galaxies with K ~ 21 (the typical bright- 
ness of galaxies with M > 10 11 M© at z ~ 2.5) will 
not be feasible in the foreseeable future. Until the next 
generation of space missions and > 20 m ground-based 
telescopes we remain largely dependent on photometric 
redshifts for studies of large and faint galaxy samples 
beyond z > 1.5. 

Our provisional dependency on broadband photomet- 
ric studies requires a more accurate calibration and un- 
derstanding of the involved systematics. The current 
spectroscopic samples used for calibration of photomet- 
ric high-redshift studies are based primarily on optical 
spectroscopy. As these samples are biased towards un- 
obscured star-forming galaxies, their calibration may not 
be representative for the total sample of massive galax- 
ies. Photometric properties of red, massive galaxies at 
high redshift are poorly calibrated, and since red galax- 
ies dominate the high mass end at 2 < z < 3, systematics 
may have large effects on the final results. 



NIR spectroscopy on a substantial, unbiased sample of 
massive, high-redshift galaxies is needed to test our pho- 
tometric studies, and obtain insights regarding possible 
systematic effe cts. The Gemini Near-InfraRed Spectro- 
graph (GNIRS. lEhas et al.ll2006h is especially well-suited 
for spectroscopy of z ~ 2.3 galaxies, due to the large 
wavelength coverage offered by the cross-disperser (0.9- 
2.5 /im). This coverage offers two advantages: there is a 
large probability of finding emission lines, and it allows 
the characterization of the continuum emission, which 
is partic ularly impor t ant for galaxies without detected 
lines. In iKriek et alJ (|2006ri) we found that a substan- 
tial fraction of the massive galaxies at z ~ 2.3 have no 
detected emission lines. For these galaxies we derived 
spectroscopic redshifts by modeling the st ellar contin- 
uum, driven by the Balmer/4000 A break (jKriek et al.l 
l2006al) . 

In this paper we present our full survey of X-selected 
galaxies at z ~ 2.3, conducted with GNIRS on Gemini- 
South between September 2004 and March 2007. In to- 
tal we integrated more than ~80 hours, divided over 6 
observing runs, on a sample of 36 galaxies. In previ- 
ous papers based on preliminary re sults of this survey 
we discussed the stellar populatio ns (IKriek et al.ll2 006b) 
and the origin of the line emission (IKriek et al.ll2007M . In 
this paper we will give an overview of the total survey 
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Fig. 1. — Comparison between the photometric properties of the 
GNIRS sample at 2 < Zphot < 3 and a mass-limited sample (M > 
IO^A/q) at 2 < 2 phot < 3. The probabilities (P) that the GNIRS 
sample and the full mass-selected sample have similar distributions, 
as derived using a Mann- Whitney (MW) and a Kolmorov-Smirnov 
(KS) test, are given in the panels. Additionally, we divide the mass- 
selected sample into its i^-bright (K < 19.7) and A"-faint (K > 
19.7) members. The GNIRS sample may be less representative for 
a Jf-bright sample, as the redshift distribution is different. 

(§[2]), compare photometric and spectroscopic redshifts 
(§[3]) and stellar populations properties (§E]), and discuss 
the implications for photometric studies (§HJ. 

Throughout the paper we assume a ACDM cosmology 
with ft m = 0.3, tt A = 0.7, and H a = 70 km s" 1 Mpc" 1 . 
The broadband magnitudes are given in the Vega-based 
photometric system unless stated otherwise. Further- 
more, we will measure the scatter and the offset between 
various properties using the normalized biweight mean 
absolute deviation and the biwe ight mean, respectively 
(jBeers. Flvnn fc Gebhardtlll990h . As biweight statistics 
are less sensitive towards outliers than the normal mean, 
and more efficient than the median, they are most ap- 
propriate for the small sample sizes in this work. 

2. DATA 

2.1. Sample Selection 

The galaxies studied in this paper are selected from 
the MUlti- waveleng t h Survey by Yale- C hile (MUSYC, 
iGawiser et all 120061 : lOuadri et all l2007bD . This survey 
consist of optical imaging (UBVRIz) of four 30' x 30' 
fields, shallow NIR imaging (JHK) over the same area, 
and deeper NIR imaging over four 10' x 10' fields. The 
depth of the deep and wide NIR photometry is K ~ 21 
and K ~ 20 (5a) respectively. The spectroscopic follow- 
up presented in this paper is sel ected from the dee p 
fields HDF-South, 1030, and 1256 (jQuadri et al.ll2007br i. 
and the shallow extended Chandra Deep Field South 
(ECDFS, E.N. Taylor et al. 2007, in preparation). One 
galaxy is selected from th e Great Observatorie s Ori- 
gins Deep Survey (GOODS: iGiavalisco et al1l200l . The 
optical-to-NIR photometry that we used as part of this 
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Fig. 2. — Comparison of J — K and R — K colors as a function of 
^phot between a photometric mass-limited (M > 10 11 A/q) sample 
and our spectroscopic sample. The gray diamonds and dots rep- 
resent all massive galaxies in the deep MUSYC fields (SDSS1030, 
1256 and HDF-S) with K < 19.7 and K > 19.7 respectively. The 
black symbols represent the 36 galaxies of the GNIRS sample, se- 
lected from both the MUSYC deep {diamonds) and wide (ECDFS, 
crosses) surveys. 

work is described by S. Wuyts et al. (2007, in prepara- 
tion) . 

We selected galaxies with 2.0 < z p hot ^ 2.7 (see § I3.2|) 
and K < 19.7. This redshift interval is chosen as the 
bright emission lines H/3, [Om], Ha, and [Nil] fall in 
the H and K atmospheric windows. A few galaxies had 
2-7 < Zphot % 3.0 at the time of selection, but were 
observed because a large part of their confidence inter- 
vals fell in the targeted redshift range, or because the 
galax y had a known z SD e C between 2.0 and 2.7 ( CDFS- 
6202. 1van Dokkum et alJl2005t iDaddi et al.ll2005h . Due 
to catalog updates of the ECDFS field, the final z p hot of 
several more galaxies scattered out of the selected red- 
shift range. The fact that the ECDFS catalog was still 
in a preliminary stage at the time of selection may com- 
plicate some of the interpretation in this work. For this 
reason and the fact that the ECDFS is the only field with 
shallow NIR photometry, the analysis in this paper will 
also focus on the subsample excluding the ECDFS. 

In total we obtained usable NIR spectra for a sample 
of 36 galaxies. For ~4 additional galaxies we obtained 
empty spectra due to mis-alignment or extremely bad 
weather conditions. 

It is important to establish whether our sample is rep- 
resentative for the galaxy population at z ~ 2.5. In Fig- 
ure [T] we compare the distributions of J — K, R — K, rest- 
frame U — V color and z p hot of our spectroscopic sample 
with a photometric mass-limited sample (> 10 11 M Q ) at 
2 < z < 3. For the latter we use the deep MUSYC 
fields, as the wide NI R data are too shallow to extract a 
mass-limited sample (jvan Dokkum et~aT1l2006h . Accord- 
ing to a Mann- Whitney and a Kolmorov-Smirnov test, 
the photometric properties J — K, R — K, rest-frame 
U — V color (as derived from the photometry) and z p hot 
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Fig. 3. — The differences between the low resolution spectra 
of two observing sequences for the same galaxy (left: 1030-301, 
right: 1256-0) are used to estimate a systematic uncertainty on the 
spectra. In order to improve the consistency between the different 
observing sequences, we increase the original uncertainty per bin 
(black error bars) by a systematic error of 10% (gray error bars) of 
the average flux in the binned spectrum. The fractions of bins that 
are consistent within lcr for the original and increased uncertainties 
are given in the panels in black and gray respectively. 

of the GNIRS sample are representative for a photomet- 
ric mass-limited sample at 2 < z < 3 (see probabilities 
in panels of Fig. [1]). The Mann- Whitney test assesses 
whether the two sample populations are consistent with 
the same mean of distribution, while the K-S test ex- 
amines whether the two samples could have been drawn 
from the same parent distribution. For both tests the 
probability should be greater than the 0.05 significance 
level. 

The galaxies targeted with GNIRS are all bright in K 
(< 19.7). In order to examine if bright galaxies are a 
biased sub-sample of the total mass-limited sample, we 
split the sample in bright and faint members. Figure [1] 
shows that the bright and faint members have the same 
distribution of rest-frame U — V and observed R — K 
colors. The main difference between the bright and faint 
members is the redshift distribution: almost all if -bright 
galaxies have z p hot< 2.3. This also causes their bluer 
J — K colors: in contrast to the i?-band, the J-band 
does not fall entirely bluewards of the optical break for 
z < 2.3. The redshift dependence of J — K is clearly 
visible in Figure [2] However, except for the difference 
in redshift and presumably stellar mass, we see no hints 
that the bright and faint members of a mass- limited sam- 
ple at 2 < z < 3 have different stellar populations. Thus, 
although the spectroscopic sample may have similar stel- 
lar population properties as if -bright galaxies, it is less 
representative for a if -bright sample at 2 < z < 3, as 
the median redshift and its corresponding distribution is 
substantially different. 

2.2. NIR spectra 

We observed the full sample of 36 galaxies with GNIRS 
in cross-dispersed mode, in combination with the 32 




: f o =0.53 




1 




ft 




\ 











1 2 3 

(J-H) phot 



(H-K) phol 

Fig. 4. — Comparison of photometric NIR colors, and those de- 
rived from the spectra, for galaxies in the deep (solid) and wide 
(open) MUSYC fields. Both colors are not corrected for flux con- 
tributions by emission lines. For a few galaxies the spectra are 
too noisy to measure the spectroscopic NIR colors. The fraction 
of galaxies for which the photometric and spectroscopic colors are 
consistent within lcr is given in the top left corner. As these factors 
are less than 0.68, the errors may be slightly underestimated. 

lines mm -1 grating and the / /675 slit. This configu- 
ration resulted in a spectral resolution of R ~ 1000. 
The galaxies were observed during six observing runs in 
2004 September (program GS-2004B-Q-38), 2005 May 
(program GS-2005A-Q-20), 2006 January (program GS- 
2005B-C-12) and 2006 February (program GS-2006A-C- 
6), 2006 December (program GS-2006B-C-5) and 2007 
March (program GS-2007A-C-9). During the first two 
runs most time was lost due to bad weather, and only a 
handful of galaxies was observed under mediocre weather 
condition (seeing ~ 1"). The weather was excellent 
throughout the full 3rd and 4th run, and we reached 
a median seeing of ~0'.'5. The conditions were slightly 
worse during the last two runs, with a median seeing of 
~0'/7, and some time was lost due to clouds. 

We observed the galaxies following an ABA'B' on- 
source dither pattern, such that we can use the average of 
the previous and following exposures as sky frame. This 
cancels sky variation and reduces the noise in the final 
frame. All targets were acquired by blind offsets from 
nearby stars. The individual exposures are 5 minutes for 
the galaxies observed during the first two runs, and 10 
minutes for the remaining runs 13 . The total integration 
times for all galaxies are listed in Table [1] Before and 
after every observing sequence we observe an A V0 star, 
for the purpose of correcting for telluric absorption. The 
final spectra of the two stars were combined to match 
the target's airmass. 

A detailed description of the reduction pro cedure of the 
GNIRS cross-dispersed spectra is given in iKriek etlrtl 

13 An instrument upgrade after the first two runs improved the 
throughput and thus the quality of the spectra, and eliminated 
"radiation events" caused by radioactive coatings. This allowed 
longer integrations 
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Fig. 5. — GNIRS spectra (black squares) and MUSYC broadband photometry (open circles) for all 36 galaxies, sorted for their total 
if-band magnitude starting with the brightest galaxy (see Table HI . Fluxes are given in 10 — 19 ergss -1 cm -2 A -1 . Emission line fluxes are 
removed from the binned spectra, using the best-fit models to the lines. The photometry is not corrected for emission line contamination. 
The best-fit stellar population models to the spectra and photometry are shown in gray. For emission-line galaxies the redshift was fixed 
at zn nc during fitting, while for the remaining galaxies z was a free parameter. 
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lj2006aH . In summary, we subtract the sky, mask cos- 
mic rays and bad pixels, straighten the spectra, combine 
the individual exposures, stitch the orders and finally 
correct for the response function. ID spectra are ex- 
tracted by summing all adjacent lines (along the spatial 
direction) with a mean flux greater than 0.25 times the 
flux in the central row, using optimal weighting. We also 
constructed "low resolution" binned spectra from the 2D 
s pectra for each galax y following the method as described 
in lKriek et al.l (|2006af ) . Each bin contains 80 "good" pix- 
els (i.e., wavelength regions with high atmospheric trans- 
mission and low sky emission), corresponding to 400 A 
per bin. Sky, transmission, and noise spectra were con- 
structed for each galaxy as well. 

We assess the uncertainties on the low resolution spec- 
tra by splitting the data in two sequences for several 
objects and comparing the results. We find that the er- 
rors as derived from the photon noise underestimate the 
true uncertainty for most galaxies. In Figure [3] we show 
two examples. In order to obtain a better consistency 
between the observing sequences, we increase the uncer- 
tainties for all bins by quadratically adding 10% of the 
average flux in the spectrum. 

We use the broadband NIR photometry to perform the 
absolute flux calibration. For each galaxy we integrate 
the spectrum over the same </, H and K filter curves 
as the photometry. We determine one scaling factor per 
galaxy, from the difference between the spectroscopic and 
photometric NIR magnitudes, and use this factor to scale 
the NIR spectrum. We note that at this stage both the 
spectra and the photometry may contain flux contribu- 
tions by emission lines. We extend our wavelength cov- 
erage by attaching the optical photometry to the scaled 
spectra. For the emission-line galaxies we subsequently 
remove the line fluxes from the affected bin s, using the 
best-fi t to the emission lines as derived in iKriek et al.l 
l(2007h . 

As a quality check we compare the photometric NIR 
colors J — H, J — K and H — K to those derived from the 
spectra. The direct comparison for the individual galax- 
ies is presented in Figure As expected, the scatter 
is larger for the galaxies with shallow NIR photometry. 
The fraction of galaxies for which the spectroscopic and 



photometric colors are consistent within la are listed in 
the panels. For all colors these fractions are slightly lower 
than expected, suggesting that the uncertainties may be 
slightly underestimated. This may be partly due to color 
gradients in the galaxies, as we use different apertures to 
determine the colors. The spectroscopic apertures are 
rectangular, and depend on the extraction radius and 
method, which differs per galaxy. On average the spec- 
troscopic apertures are 0'.'675 by l'/2, slightly smaller 
than the circular photometric apertures which have a 
diameter of ~1'.'4. 

The final low resolution spectra and broadband pho- 
tometry for the full sample arc presented in Figure O 
The galaxies are ordered by total K magnitude (see Ta- 
ble QJ, starting with the brightest galaxy. Remarkably, 
there is not a strong trend between quality of the spec- 
trum and the total if-band magnitude for the same inte- 
gration time. We suspect that the surface brightness of 
the object plays an important role, as bright objects with 
low quality spectra and typical integration times, such as 
ECDFS-4511 and ECDFS-9822, are extended even in the 
MUSYC imaging, which have a image quality of ~ 1". 

3. SPECTROSCOPIC VERSUS PHOTOMETRIC REDSHIFTS 
3.1. Spectroscopic Redshifts and Galaxy Properties 

For 19 of the galaxies in the sample we detected one 
or more emission lines, and thus for these galaxies we 
could determine exact spectroscopic redshifts. The re- 
maining galaxies may have no or very faint emission 
lines, or the lines are expected in atmospheric wavelength 
regions with low transmission or strong sky emission. 
Fortunately, we can derive fairly precise redshifts from 
the continuum emission alone, mainly due to the pres- 
ence of the Balme r/4000 A break in the NIR spectra 
(|Kriek et al.l l2006a) . For none of the galaxies absorption 
lines are detected. 

We fit the low resolution bi nned spectra togeth e r with 
the optical photometry by iBruzual fc Charlotl (2003) 
stellar population models. We allow a grid of 24 different 
ages (not allowing the galaxy to be older than the age 
of the universe), and 31 different exponentially declin- 
ing star formation histories (SFHs) with the characteris- 
tic timescale (r) varying between 10 Myr and 10 Gyr. 
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Fig. 6. — In this figure we illustrate the accuracy of continuum 
rcdshifts, by deriving 2 Gon t f° r th e 19 emission-line galaxies in the 
sample. The scatter and the systematic offset in Az/(1 + z) are 
listed in the figure. The continuum redshifts are ~ 4 — 7 times 
more accurate than photometric redshifts, and show no significant 
systematic offset. Galaxies without emission lines generally have 
larger breaks, so their 2: con t may even be more accurate. 

We leave redshift as a free parameter for the galax- 
ies without emission lines. Furthermore, we adopt the 
ICalzetti et al.l (|2000() reddening law and allow 41 values 
for Ay between en 4 mag. We compute the \ 2 sur ~ 
face as function of all stellar p opulation param eters. For 
all grid points we a ssume the [Salpeterl (U955) IM F, and 
solar metallicity. A IChabrierl <[2003f) or a iKroupal (|2001l ) 
IMF yield stellar masses and SFRs which are a factor of 
~ 2 lower. The mass differences when using the stellar 
population library by iMarastonl (120051) are discussed in 
iKannappan fc Gawiserl (|2007l) and lWuvts et all (|2007h . 

We derive la confidence intervals on the redshifts and 
stellar population properties using 200 Monte Carlo sim- 
ulations. We vary all bins of the low-resolution spectra 
according to their uncertainties, and fit the simulated 
spectra using the same procedure as described above. 
Next, we determine the contour in the original x 2 sur- 
face that encompasses 68% of the Monte Carlo s imula- 
tions (see iPapovich et~aHl200alKr'iek et al.l l2006a'). The 
la confidence intervals for all properties are the mini- 
mum and maximum values that are allowed within this 
X 2 contour. 

For the emission-line galaxies, we removed the emis- 
sion line fluxes from the spectra before fitting. This is 
differen t to our previous method presented in lKriek et al.1 
(|2006af ) in which we mask the bins that are contami- 
nated by emission lines. The difference in modeling re- 
sults (although consistent within the errors) compared to 
iKriek et~a l. (2006b, 20Q3) are due to this improvement 
and updates of the broadband photometry catalogs. All 
spectroscopic redshifts and corresponding stellar popula- 
tion properties are listed in Table [2] 

In order to test the accuracy of the continuum red- 
shifts, we also fit the emission-line galaxies with redshift 
as a free parameter. In Figure [6] we compare the emis- 
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Fig. 7. — In this diagram we examine the cause of errors in con- 
tinuum redshifts. As the modeling is mainly driven by the optical 
break, we expect and indeed find less accurate continuum redshifts 
for galaxies for which a large part of the break falls between atmo- 
spheric windows or outside the spectrum (panel a, shaded regions). 
These galaxies are indicated by gray symbols in all panels. In pan- 
els b and c we show that for galaxies with bluer SEDs, and thus 
weaker optical breaks, the continuum redshifts are less accurate. 
There is no clear correlation with the S/N of the spectrum in panel 
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Fig. 8. — left: Galaxy templates as used by the original photo- 
metric redshift code, right: Galaxy templates as constructed from 
the GNIRS sample. 

sion line redshifts with the continuum redshifts. We find 
a scatter of Az/(1 + z)= 0.019 and no significant system- 
atic offset. In Figure [7] we examine causes of the errors. 
First, as the modeling is driven by the optical break, we 
expect this method to be less accurate if the break falls 
between atmospheric windows. In the Figure [T^t we in- 
deed find that galaxies for which the break falls outside 



8 



Kriek et al. 



TABLE 2 

Spectroscopic modeling results 
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Note. — The stellar p opulation properties are derived by fitting the low resolution spectra together with the optical photometry by 

bruzual fc Charloj J2OO3I) stellar population models. For the emission-line galaxies the rcdshift was fixed to z spcc . The errors represent the 

68% confidence intervals derived using 200 Monte Carlo simulations. 

a The continuum redshifts are those for which we give the confidence intervals. 



the spectrum or in between the J and H band have less 
accurate z C ont- 

Furthermore, we expect the continuum redshifts to be 
more accurate for galaxies with strong optical breaks. In 
the Figures _J> and c we show the correlation with rest- 
frame U — V color and observed R — K color. If we ex- 
clude the galaxies for which the break falls between atmo- 
spheric windows {gray symbols), we indeed find that blue 
galaxies with weak optical breaks, have less accurate con- 
tinuum redshifts. The continuum redshifts for galaxies 
without emission lines might even be more accurate, as 



these g alaxies generally have larger breaks (jKriek et al.l 
l2Q06bQ . 

Finally, in Figure [31 we examine the correlation be- 
tween (,z con t — £iine)/(ziine + 1) and the S/N per bin. Re- 
markably, we do not find an obvious trend. However, 
the different causes for errors and the small size of the 
sample may have affected a possible correlation. 

3.2. Photometric Redshifts and Galaxy Properties 

Photom etric redshifts are derived usi ng the method de- 
scribed bv iRudnick et all (|2001l l2003f) . The code fits a 
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Photometric modeling results 
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Note. — The stellar population properties arc derived by fitting the photometry by Bruzual & Chariot ( 2003) models, fixing the rcdshift to 

-Sphot- The Zphot confidence intervals are derived using 100 Monte Carlo simulations Jl^i^niTl^t^alTT2^ull[2ll03h . We use the same simulations 

in combination with the best-fit r p i luL for parb simulation to derive the la (68%) confidence intervals of the stellar population properties. 
a Derived using method bv lRudnick et al.l i hoO A I2OO3I) 



linear non-negative superposition of the 8 templates pre- 
sented in the left panel of Figure [5] to the broadband 
photometry using x 2 minimization. This template set 
consi st of the empirical E, Sbc, Scd an d Im templates 
from IColeman. Wu &: Weedmar] (|1980l), the two leas t 
reddened starburst templ ates from[Kinnev et alJ lll996f ). 
and a 1 Gyr and 10 Myr iBruzual k Charlotl (|2003f ) sin- 
gle stellar population with a lSalpeteri ( 19551 ) initial mass 
function (IMF). Confidence intervals on z pno t are deter- 
mined using 100 Monte-Carlo simulations, in which the 
broadband photometry is varied according to the photo- 



metric uncertainties. 

We determi ne photometric stell a r pop ulations proper- 
ties by fitting [Bruzual fc Charlotl (|2003| ) stellar popula- 
tion models to the broadband SEDs, with the redshift 
fixed to the be st-fit z D hot- We appl y the same grid and 
reddening law (Calzctt i et alJ 12000? ) as for the spectro- 
scopic fitting (§ 13. ip . The 100 simulations of the pho- 
tometry in combination with their best-fit z p hot are used 
to derive the confidence intervals on the stellar popula- 
tion properties (see procedure as described in § 13. 1|) . 

The errors on the photometric redshifts and rest- frame 



10 



Kriek et al. 



3.5 - 



3.0 - 



2.5 ■ 



2.0 ■ 



1.5 ■ 




Shallow NIR photometry 
Deep NIR photometry 



« G 

C 3 
O 3 



H (J 

o □ 



20 


- All 




Az/(l+z) - 






scatter: 0.13 


15 






offset: 0.08 


10 








5 


















14 - All 
12 - — SPEC 

10 - PH0T 




Deep 



Az/(l+z) 
scatter: 0.08 
offset: 0.03 





1.5 



2.0 



2.5 



3.0 



3.5 



1.0 1.5 2.0 2.5 3.0 3.5 



1.0 1.5 2.0 2.5 3.0 3.5 



Fig. 9. — Spectroscopic vs. photometric redshifts. The scatter 
and offset in Az/(l + z) for the full sample are 0.13 and 0.08 respec- 
tively. The squares and circles represent the continuum and emis- 
sion line redshifts respectively, for galaxies in the deep (filled sym- 
bols) and wide (op en symbols) MUSYC fields. DRGs (J - K > 2.3, 
IFranx et aI|[2003T ) are indicated in red and non-DRGs are shown 
in blue. Corresponding systematic and random errors are given in 
Table H 



TABLE 4 

Comparison between photometric and spectroscopic 

REDSHIFTS (Az/(1 + z)) 



All 



All 



Deep 



Wide 





scat 


offs 


scat 


offs 


scat 


offs 


All 


0.13 
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^cont 
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^lines 


0.10 


0.07 


0.06 


0.03 


0.24 


0.16 


DRGs 


0.11 


0.08 


0.07 
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DRGs, 2 linos 
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non-DRGs 


0.13 
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non-DRGs, zii ncs 


0.14 


0.09 


0.16 


0.11 


0.09 


0.06 



Note. — In this table we give the scatter and systematic 
offset (A^/(l + z)) between the photometric and spectroscopic 
redshifts for the full sample, and different subsamplcs. 

lu minosities are d e rived i nde pendently using the method 
bv lRudnick et all (|200lL I2003D . Due to the independent 
fitting procedures, we do not have a combined \ 2 distri- 
bution for M and Ly- For the confidence intervals on 
M/Ly we take the minimum and maximum M/Ly cor- 
responding to the 68% best stellar population fits, i.e., 
with the lowest \ 2 ■ All photometric redshifts, stellar 
population properties and corresponding confidence in- 
tervals are listed in Table EB 

3.3. Direct Comparison 

In Figure [5] we compare z spcc and z p hot for all indi- 
vidual galaxies. We find a scatter of Az/(1 + z)= 0.13 
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Fig. 10. — Spectroscopic (black solid histograms) and photo- 
metric (diagonally hatched histograms) redshift distributions (top 
panels) and their summed probability distributions (middle panels) 
for the full spectroscopic sample (left panels), and for those galaxies 
with deep NIR photometry (right panels). The scatter and system- 
atic offset in Az/(1 + z) are given in the top right corner in the top 
two panels. The lower panels show the distribution of Az/(1 + z) 
for the full (left) and deep (right) sample respectively. 

and a systematic offset of Az/(1 + z)= 0.08, such that 
our photometric redshifts are on average too high. These 
values are 0.08 and 0.03 respectively for galaxies in the 
MUSYC deep fields only (see Table BJ). The distributions 
of the spectroscopic and photometric redshifts for the 
full and the deep sample, are shown in Figure [TOl Using 
optical spectroscopy and the same ph otometric redshift 
code and templates, IWuvts et alj (|2008l ) find a scatter of 
Az/(l+z)~ 0.06 and a median offset of Az/(l + z)~ 0.02 
for galaxies with deep NIR photometry in the same red- 
shift range. This result is similar to ours when comparing 
only to the galaxies with deep NIR photometry. 

The broadband photometric redshifts of galaxies with 
emission lines have smaller errors than those of galaxies 
without emission lines, especially when we exclude the 
galaxies with shallow NIR photometry. However, our 
sample is small, and studies of larger spectroscopic sam- 
ples are needed to verify this result. If this result still 
holds for larger samples, it probably implies that the 
quality of photometric redshifts, which are calibrated us- 
ing just emission-line redshifts, may be overestimated. 

The broadband photometric redshifts of galaxies with 
emission lines have smaller errors than those of galax- 
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Fig. 11. — Broadband photometry of all galaxies for which the photometric and spectroscopic redshifts are substantially different 
(Az/(1 + z)> 0.1). The dark gray fit shows the best-fit stellar population model to just the broadband photometry. The best-fit stellar 
population model to the spectrum in combination with the optical photometry is presented by the light gray fit. Both the photometric and 
spectroscopic (emission-line or continuum) redshifts are given in the panels. 



ies without emission lines (see also Table 2]), especially 
if we exclude the galaxies with shallow NIR photome- 
try. However, our sample is small, and studies of larger 
spectroscopic samples are needed to verify this result. 
If this result still holds for larger samples, it probably 
implies that the quality of photometric redshifts, which 
are calibrated using just emission-line redshifts, may be 
overestimated. We stress, however, that galaxies with 
emission-line and continuum redshifts do not precisely 
present two distinct classes within this sample, as for 
certain redshift intervals emission lines are expected in 
between atmospheric windows. 

Whereas spectroscopic redshifts for blue galaxies can 
be obtained by optical spectroscopy as well, for most 
red, optically faint galaxies NIR spectroscopy offers the 
only possibility to obtain a z spec . Previous studies, our 
photometric redshift estimates among them, were mainly 
calibrated using optical spectroscopy. Therefore, it is 
interesting to know whether red and blue high-redshift 
galaxies show similar scatter and systematics. We divide 
the full sample into two classes using the DRG criterion 
(J — K > 2.3. lFranx et al.ll2003D . In Figure M we indicate 
the DRGs and non-DRGs by red and blue symbols re- 
spectively. We find no significant difference in the scatter 
or systematic offset in Az/(l+z) for blue and red galaxies 



for the total sample. When only considering the galaxies 
with deep NIR photometry, DRGs seem to have more ac- 
curate photometric redshifts than non-DRGs. We note, 
however, that due to the redshift dependence of the J—K 
color (see Fig. the trend between Az/(l + z) and 2; spe c 
(which is a logical result of our Zphot selection) compli- 
cates the interpretation of this test. In the next section 
we will use rest- frame colors in order to avoid this effect. 

3.4. Systematics and Catastrophic Failures 

In this section we will examine the different causes for 
systematics and catastrophic failures in Az/(1 + z) as 
identified in the previous subsection. We define failures 
as galaxies for which Ax/(1 + z)> 0.1. In Figure [TT] 
we show the broadband photometry and best-fit stellar 
population models, in- or excluding the spectral infor- 
mation, for all failures. However, for several galaxies the 
photometric redshift is poorly constrained - for example 
due to degeneracies - and despite the large discrepancy, 
agrees reasonably well with the confidence inter- 



val of 



^phot ■ 



Thus we discriminate between failures for 



which z spcc is or is not consistent within 3 a with z p hot, 
and define these as "good failures" and "bad failures" 
respectively. This is illustrated in Figure [T2"k . in which 
we show Az/(1 + z) versus the difference between z p hot 
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Fig. 12. — In this figure we examine systematics and catastrophic failures in Az/(1 + z). Squares and circles represent galaxies with 
continuum and emission-line redshifts respectively, in the wide (open symbols) and deep (filled symbols) MUSYC surveys. Galaxies with 
Az/(1 + z)< 0.1 are represented by the black symbols. In panel a we classify failures (Az/(1 + z)> 0.1) such that galaxies for which 
Zspcc is within 3cr consistent with z p hot are defined as "good failures" (red), while galaxies that do not meet this criterion are defined 
as "bad failures" (blue). Furthermore, we relate (z p hot — Zspec)/(1 + z spcc ) with the S/N of the NIR photometry (b), the difference 
between the photometric and spectroscopic J — H colors (c), the contamination of the broadband photometry by emission lines (d), total 
iC-band magnitude (e), rest-frame U — V color (/), and the stellar population properties Ay (g) and age (h). This figure illustrates that 
( 2 phot — z spcc )/(l + z spcc ) is generally larger for galaxies with shallow NIR photometry, and mainly correlates with SED type. 



and 



'spec 



in a. For one galaxy with Az/(1 + z)< 0.1 



(1256-519), ■Zphot and z S p ec are not consistent within 3 o . 

FigurefT2b shows that all bad failures have shallow NIR 
photometry, with an average S/N in H and K of < 10. 
This illustrates that deep NIR photometry is needed to 
get meaningful photometric redshifts and corresponding 
confidence intervals. Not all galaxies with shallow NIR 
photometry are bad failures, or even failures at all, but 
the probability that a galaxy with shallow NIR data has a 
Zphot which is inconsistent with the spectroscopic redshift 
is substantially larger than for galaxies with deep NIR 
data. 

As good failures have similar Az/(l+z) as bad failures, 
their confidence intervals are by definition larger. Large 
confidence intervals may be caused by shallow NIR data 
or model degeneracies. Several good failures indeed have 
shallow NIR photometry, and their photometric redshifts 
are - possibly by coincidence - consistent with their spec- 
troscopic redshifts. The deviating photometric redshifts 
of high S/N failures must have a different origin. In the 
following we examine several potential causes for good 
failures: 

Deviating NIR colors: In Figure 112b we relate (z p hot — 
£spec)/(l + Zspcc) with the difference between the pho- 



tometric and spectroscopic J — H color (see i)2.2[) . As 
the optical break falls between the J- and ff-band for 
z = 2 — 3 galaxies, a deviating J — H color may affect 
the optical break identification, and thus may place a 
galaxy at the wrong redshift. This may have happened 
for 1030-609 and 1030-2927 (see Fig.©. We note that a 
few galaxies are missing from Figure [T2"b as their spectro- 
scopic NIR colors could not be determined (see Fig. [4]) . 

Emission-line contamination: Contamination by emis- 



sion lines generally reddens the rest-frame optical colors, 
which may result in a wrong break identification. Fig- 
ure [T2H shows that for several galaxies the photometric 
NIR colors appear redder due to the flux contribution by 
emission lines. However, there is only one galaxy (CDFS- 
6026, see Fig. [SJ with deep NIR photometry, for which 
the emission line contamination may be the main cause 
for a wrong z p hot- 

K band magnitude: Due to the shape of the luminos- 
ity function, the fraction of low to high redshift galax- 
ies is much larger at brighter K magnitudes. Thus, it 
is expected that the interloper fraction is larger at the 
bright end. In Figure [T^b we indeed see that the fraction 
of good failures compared to the total sample increases 
at brighter K magnitudes. This effect can be reduced 
by using a luminosity prior in the z p hot fitting code, to 
break the degeneracy in favor of the correct low-redshift 
solution. 

Correlations with SED types: In Figures I12f , g and h 
we examine correlations with SED type by comparing 
(zphot - z spcc )/(l + z S p C c) with rest-frame U — V color, 
best-fit age and dust content (all derived from the spec- 
troscopic information). We use rest-frame U — V colors, 
instead of observed J — K colors, as the latter are depen- 
dent on redshift (see § 13. 3|) . We find no obvious corre- 
lation between (z p hot — z spec )/(l + z S pcc) and rest- frame 
U — V color. Nonetheless, Figures [12g and h suggest a re- 
lation with stellar population properties, such that young 
and/or dusty galaxies have a higher probability of hav- 
ing a wrong z p hot ■ These galaxies are in our photometric 
redshift code generally best-fit by the 1 Gyr dust-free 
template (Fig. |5J), which places them at the wrong red- 
shift (see also § 14. ip . One of these galaxies is 1256-519, 
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Fig. 13. — In this figure we test the different causes for scatter 
and systematics in Az/(1 + z). In panels a we show the original 
2 phot an d m panels b we use the new templates as shown in Figure|8] 
to derive photometric redshifts. In panel c we have repeated the 
original fitting procedure, replacing the NIR photometry by the 
spectroscopic J, H and K fluxes. In panel d we remove the emis- 
sion line fluxes from the broadband photometry before applying 
the photometric redshift procedure. Circles and squares represent 
galaxies with emission line and continuum redshifts respectively. 
For galaxies with deep NIR photometry the symbols are filled. The 
scatter and systematic in Az/(l + z) are given in black for the high 
S/N galaxies and in gray for the total sample. This figure illus- 
trates that the systematic offset in Zphot can almost completely be 
removed and the absolute errors reduced by using well calibrated 
templates. 

for which z p hot and -Z S pe C are not consistent within 3 a 
(although Az/(1 + z)<0.1). This may suggest that ap- 
propriate templates (i.e., dusty ones) are missing in our 
Zphot procedure. We will examine this in more detail in 



All discussed potential causes generally overestimate 
photometric redshifts, and place the galaxy at too high 
redshift. Altogether, this introduces a systematic effect. 
In the next section we will examine the influence of the 
different causes, by attempting to correct for them. 

3.5. An Empirically-Motivated Template Set 

In order to test the different causes found for the sys- 
tematics and failures in Az/(1 + z), we refit the pho- 
tometry attempting to correct for these effects. First we 
construct a new template set from our GNIRS sample. 
By eye we pick six best-fit SEDs from Figure [51 to be 
representative for the z ~ 2.5 massive galaxy popula- 
tion. These new templates are presented in the right 
panel of Figure [3 We refit the broadband photometry 
of all galaxies only allowing these 6 templates, without 
superposition and luminosity prior, in steps (Az) of 0.02. 

In Figures 113b and b we show the comparison between 
the old and new z p hot) respectively, versus z S pcc- The 
galaxies with high S/N NIR photometry are presented 
by filled symbols. The new templates reduce both the 
systematics and scatter. Remarkably, the improvement 
is largest for galaxies with low S/N photometry. 

Although, the newly defined template set yields more 
accurate photometric redshifts than the original template 



set, it provides no general solution. It is constructed 
to work especially for massive galaxies at 2 < z < 3, 
and thus they may not be appropriate when applied to 
lower redshift or less massive galaxies. Nevertheless, this 
test illustrates the importance of using well-calibrated 
templates. Similar template mismatches may also apply 
to other galaxy populations, and further exploration is 
needed in order to better calibrate high-redshift studies. 

In order to test whether errors in the NIR broadband 
photometry may be responsible for systematics or may 
lead to an increase in the scatter, we attempt to correct 
the photometry using the NIR spectra. In Figure [L3"b 
we replace the J, H and K by the spectroscopic col- 
ors and repeat the original fitting procedure, using the 
original templates. The spectroscopic colors do not lead 
to an improvement, and in fact even lead to more pro- 
nounced scatter and systematics, in particular for the 
galaxies with deep NIR photometry. This might not be 
surprising, as for these galaxies the S/N of the J — H 
photometric colors is generally higher than the S/N of 
the spectroscopic colors. 

Furthermore, we remove the emission line fluxes from 
the broadband photometry before repeating the original 
fitting procedure. Figure[T3H shows that this does not re- 
duce the scatter or the systematics. This may be because 
several of the original templates contained emission lines. 
Also, the massive galaxies studied in this work all have 
comparatively weak emission lines, and contamination 
may play a much larger role for less massive galaxies. 

4. IMPLICATIONS 

The spectroscopic redshifts and higher resolution spec- 
tral shapes as obtained by the NIR spectra provide better 
constraints on the stellar population properties. This al- 
lows us to test the significance and reliability of the pho- 
tometrically derived properties, and examine the com- 
pleteness and systematics of photometric samples. In 
§ 14.11 we quantify the improved constraints on the stel- 
lar population properties, and identify the driving force 
behind this improvement. In § 14.21 we test photometric 
samples, and finally in § 14.31 we assess previous studies 
based on photometric samples. 

4.1. Implications for Derived Properties 

In this section we compare the photometric and spec- 
troscopic stellar population properties, as listed in Ta- 
bles [2] and [3l Note that the photometric properties are 
derived using the original photometric redshifts (§ 13. 2|) . 
FigurefXH illustrates the effect on the properties age, Ay, 
r, stellar mass and SFR when including the spectroscopic 
information. In the left panels we show the comparison 
for individual galaxies. The panels in the second column 
present the distributions of the photometric and spec- 
troscopic properties. In the third column we show the 
distribution of the difference between photometric and 
spectroscopic properties. Finally in the panels in the 
fourth column we show the ratio of the photometric and 
spectroscopic ler confidence intervals. 

The differences between the best-fit photometric and 
spectroscopic properties in the left diagrams of Figure flU 
are striking. Ay has a scatter of ~1 mag, and age, r and 
SFR each have a scatter of a factor of ~3-4, between the 
photometric and spectroscopic values. Despite the large 
scatter, for a high fraction (75-89%) of the galaxies the 
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Fig. 14. — Comparison of the photometric and spectroscopic properties age (since the onset of star formation), Ay, r, stellar mass, 
SFR, M/Ly ratio, rest-frame V magnitude, and rest-frame U-V color. The left panels show the direct comparison for the individual 
galaxies with the symbols similar as in Figure [12] The fraction (F c ) of galaxies for which the photometric and spectroscopic properties 
are consistent within la is given in the top left corner, and the typical error in the bottom right. The dotted diagonal lines indicate 
where the spectroscopic properties are equal to the photometric properties. The panels in the second column show both distributions. 
The typical errors for the photometric and spectroscopic properties are given in the top right corner. The distribution of the difference 
between the photometric and spectroscopic properties is presented in the panels in the third column. The offset (biweight mean) and scatter 
(normalized biweight mean absolute deviation) of the distribution is given for each property for the total and deep sample. In the right 
panels we present the distribution of the ratio of the photometric and spectroscopic confidence intervals for the individual galaxies. The 
black histograms represent all galaxies, and the colored, diagonally hatched areas correspond to the galaxies with deep NIR photometry. 
This figure illustrates that the stellar population properties such as the age, Ay, SFR, and r are generally very poorly constrained with 
broadband data alone. Interestingly, stellar masses and M/Ly are among the most stable properties. 
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Fig. 14. — Continued 



photometric properties are consistent with the spectro- 
scopic properties within 1 a. This demonstrates the relia- 
bility of the large confidence intervals in Tabled and sup- 
ports that the best-fit photometric properties age, Ay, t 
and SFR are poorly determined for individual galaxies. 
The spectroscopic properties are better constrained, but 
the errors are still substantial, with a factor of ~2 for 
age, tau, and SFR and ~0.4 mag for Ay. Thus, it is not 
surprising that the stellar population properties show a 
large scatter, even when the photometric redshift is close 
to the spectroscopic redshift. 

Although the photometrically and spectroscopically 
derived best-fit values are consistent, the photometric 
and spectroscopic distributions show systematic differ- 
ences (Fig. [TH 2nd column). In § 13. 41 we identified possi- 
ble correlations between Az/(1 + z) and stellar popula- 
tion properties. Due to the lack of appropriate templates, 
the stellar population properties seem to "converge" to 
the properties of the template that provides the best fit 
during the z p hot procedure. As many galaxies are best- 
fit by the dust-free 1 Gyr template, we see a peak at 
Ay = and an age of 1 Gyr for the photometry. The 
Ay — peak completely disappears, and the age peak 
shifts to lower ages when including spectroscopic infor- 
mation. This suggest that the distribution of best-fit 
photometric stellar population properties, as presented 
in the 2nd column of Figure [T4l may be strongly biased. 

While it is clear that spectroscopic data significantly 



improve the constraints on the stellar population proper- 
ties, it is interesting to know whether the improvements 
are primarily due to the precise redshift determination or 
whether they are also caused by the spectroscopic mea- 
surement of the continuum shape. We test the impact 
of both effects by modeling the broadband photometry 
by stellar population models (see procedure § I3.ip , fix- 
ing the redshift to z spcc . In Figure [TS] we disentangle 
the two effects, by comparing on the horizontal axis the 
improvement of the stellar population properties due to 
the higher resolution spectral shape (i.e., new model- 
ing results versus spectroscopic properties) and on the 
vertical axis the effects of the better constrained red- 
shift (i.e., photometric properties versus new modeling 
results). The corresponding scatter and systematics are 
listed in Tabled Remarkably, the results for age and r 
are more affected by the higher resolution spectral shape, 
rather than by the better constrained redshift. For SFR 
and Ay both effects are equally important. 

Photometric stellar mass is significantly better con- 
strained, with a scatter of ~ 2. Nevertheless, this prop- 
erty is generally overestimated by a factor of ~ 1.4. Even 
if we exclude the galaxies with low S /N NIR photometry, 
we still obtain photometric stellar masses which are on 
average a factor of ~ 1.3 too high. This overestimation is 
lower than the factor of 2 found bv lShaplev et al.l (2005) 
for blue galaxies. However, the values are not directly 
comparable, as their mass ratio is derived by using spec- 
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Fig. 15. — In this figure we examine the influence of both the better constrained redshift and the higher resolution spectral shape on the 
stellar population properties (see also Table [5). The y-axes present the difference between best-fit properties by including spectroscopic 
redshifts. We derived this improvement by fitting the photometry replacing Zphot by z S p C c. On the x-axes we show the additional 
improvement due to the higher resolution spectral shape. This improvement is the difference between the best- fit v alues for fitting the 
photometry with the redshift fixed to 2 S pec, in- and excluding the NIR spectrum. Symbols are similar as in Figure [12] 
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0.05 


0.47 




deep 


-0.12 


0.88 


-0.11 


0.35 


-0.02 


0.45 


T 


all 


0.89 


4.18 


0.93 


3.00 


1.00 


4.77 




deep 


0.80 


3.13 


0.82 


2.11 


0.76 


5.74 


mass 


all 


1.38 


2.23 


1.11 


1.78 


1.14 


1.89 




deep 


1.28 


2.05 


1.11 


1.72 


1.06 


1.78 


SFR 


all 


0.81 


4.13 


0.83 


4.13 


0.98 


2.63 




deep 


0.88 


3.51 


0.94 


3.49 


0.78 


3.00 


M/L v 


all 


1.05 


2.06 


0.97 


1.78 


1.09 


1.75 


deep 


1.00 


1.93 


0.98 


1.76 


1.05 


1.77 


V 


all 


-0.38 


0.60 


-0.34 


0.56 


-0.01 


0.07 




deep 


-0.21 


0.40 


-0.19 


0.38 


-0.00 


0.04 


(U-V) 


all 


-0.06 


0.30 


-0.06 


0.21 


0.02 


0.08 


deep 


-0.08 


0.20 


-0.06 


0.14 


0.01 


0.06 



Note. — In this tabic wc list for all properties the scatter and 
systematic offset between the photometrically and spectroscop- 
ically derived properties (Q p h(2 p hot) vs - Qsp( 2 spec))- Further- 
more, we examine whether this scatter between the photometric 
and spectroscopic properties is mainly driven by the precise red- 
shift measurement (Qph(zphot) vs. Q p h( 2 spcc)), or the higher 
resolution spectral shape (Qpht^spcc) vs. Q S p(z spcc )) 

troscopic or photometric redshifts when fitting UGRK 
and IRAC photometry. In Figure [15] we find that the 
improved constraints on stellar mass is evenly driven by 
a better constrained redshift and the higher resolution 
spectral shape (see also Table [5]). Thus, spectroscopic 
redshifts combined with broadband photometry may still 
overestimate the stellar mass, although by a small factor 
of ~ 1.1. 



In Figure [TJ] we also show the comparison between the 
photometrically and spectroscopically derived M/Ly, 
rest-frame magnitude V and rest-frame U — V color. Of 
all photometric properties Mj Ly is the best determined, 
and shows no systematic offset. Rest-frame U — V colors 
are in general underestimated by ~ 0.1 mag, and show a 
scatter of ~ 0.2 for the galaxies with deep NIR photome- 
try. Figure [15] shows that the systematica and scatter are 
mainly due to the lack of a spectroscopic redshift. This 
effect is even worse for rest-frame V magnitudes. As ex- 
pected, rest-frame V magnitudes can very accurately be 
determined once the redshift is known. Thus, for pho- 
tometric redshifts this quantity is less well constrained 
with an offset of ~ 0.2 and a scatter of ~ 0.4 for the 
deep NIR photometry. 

Better constrained properties and confidence inter- 
vals can also be obtained by extending the broad- 
band SFdDs_b^jrndj^fnu^ 

(e.g.JLabbe et alJl2005HWuvts et al.ll2007f) . iWuvts et al] 
(]2007l ) show that the distribution of stellar population 
properties does not change significantly when IRAC data 
are added to their UB VI JHK photometry. However, for 
individual galaxies the addition of IRAC can improve 
the constraints on the stellar populations significantly 
As we do not have photometric catalogs including the 
IRAC bands for the GNIRS galaxies yet, it is difficult 
to examine whether the IRAC bands have a similar ef- 
fect as the higher resolution spectral shape on the stellar 
population properties. 

Finally, we stress that the derived photometric and 
spectroscopic properties are not completely independent, 
as we use the same optical photometry for both, and the 
NIR spectra are calibrated using the NIR photometry. 
Furthermore, the confidence levels do not include all un- 
certainties, as all photometrically and spectroscopically 
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Fig. 16. — Comparison of photometrically (open diamonds) and spectroscopically (filled squares) derived properties of the full spectroscopic 
sample. The top panels show stellar mass, rest-frame U — V color, observed J — K and R — K color versus redshift. Hence all galaxies 
are plotted twice. The J — K and R — K colors are in both cases photometric, while for the other properties (z, U — V and stellar mass) 
we adopt the spectroscopic or photometric values. The bottom panels show for both the distribution of the stellar mass, rest-frame U — V 
color, observed J — K and R — K color for galaxies in the same redshift range, and for the latter three, the same mass range. The arrows 
indicate the median values of the distributions. The typical errorbars are shown in the bottom and top right of the top and bottom panels 
respectively. The barely changed distributions of J — K and R — K imply that the spectroscopy supports the previous finding that red 
galaxies dominate the high mass end at 2 < z < 3. 

derived stellar population proper ties are based o n the 
same assumptions, among which a ISalpeteil (|1955h IMF, 
solar metallicity, an expon entially declining SFH, and the 
iBruzual fc Chariot] (|2003f ) stellar population models. 

4.2. Implications for Photometric Samples 

In the previous section we examined the improvement 
of stellar population properties when including the spec- 
troscopic information. However, due to the different 
redshift distributions, the photometric and spectroscopic 
properties as presented in Figure fl4l should not be com- 
pared directly to examine the implications for galaxy 
samples. Due to degeneracies between redshift and stel- 
lar population properties, the systematic effects may 
even be worse for complete samples, than for individ- 
ual galaxies. Thus, it is important to consider the same 
redshift interval in order to examine the systematics of 
photometric samples. 

In the top panels of Figure [T|)] we show the photomet- 
ric and spectroscopic properties stellar mass, rest-frame 
U — V color, and observed J — K and R— K color versus 
redshift of the total GNIRS sample. The photometri- 
cally derived properties are presented by open diamonds 
and the filled squares represent the spectroscopically de- 
rived properties. Hence, all galaxies are shown twice. 
This figure illustrates how photometric and spectroscopic 
properties of galaxies in a certain redshift interval differ, 
and which galaxies fall out and in the targeted redshift 
interval (2.0 < z < 2.7) when spectroscopic informa- 
tion is included. For example, several of the galaxies for 
which the photometric redshift and hence stellar mass 
were substantially overestimated, move to z < 2, and 
thus will not have a large effect on the properties of 
the 2.0 < z < 2.7 sample. We note, however, that our 



spectroscopic sample is not complete, as galaxies with 
•Zphot < 2.0 and z p hot > 2.7 may scatter into the spectro- 
scopic sample. Although the photometric sample con- 
tains 10 galaxies with z p hot > 2.7, these galaxies may 
not be representative for all galaxies with z p hot > 2.7 
and 2.0 < z spoc < 2.7. 

In the bottom panels in Figure fTHl we compare the pho- 
tometric and spectroscopic distributions of stellar mass, 
J - K, R — K and rest-frame U - V of the 2.0 < z < 2.7 
galaxy sample. The first panel illustrates that photomet- 
ric stellar masses in general are slightly overestimated. In 
order to compare how the properties of a mass-limited 
sample would change, we use a mass-cut of > 10 11 M© 
(-Mgpec for the "spectroscopic" sample and M p hot for the 
"photometric" sample) in addition to the redshift in- 
terval for the remaining properties. The second panel 
demonstrates that spectroscopically derived rest-frame 
U — V colors of a mass-limited sample at 2.0 < z < 2.7 
are slightly redder (~0.1 mag) than inferred from just 
the photometry. Furthermore, the distribution is peaked 
around U — V ~ 0.75, possibly indicating a red sequence 
(M. Kriek et al. 2007 in preparation). Also, the median 
J—K color as inferred from the photometry is ~ 0.1 mag 
too blue, while the median R — K is ~ 0.15 mag too red. 
This may reflect the fact that dusty galaxies (see § 13.4(1 
with relatively red R — K and blue J — K colors fall 
out of the sample, and move to lower redshift. Nonethe- 
less, the photometric and spectroscopic distributions are 
remarkably similar for both J — K and R — K. 

The systematics in both the photometric mass and red- 
shift distribution introduce a systematic in number of 
massive galaxies in a certain redshift interval, as shown in 
Figure [TBj As both the photometric masses and redshifts 
at z ~ 2.5 are generally overestimated, the number of 
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massive galaxies and the mass density will be lower than 
inferred from the photometry. Our sample is too small 
to directly quantify this effect. However, we can use the 
originally photometric sample to estimate the change of 
the number of massive galaxies in certain redshift bins. 

We construct a if-selected (K < 21.3) photomet- 
ric galaxy sample from the deep MUSYC catalog, and 
correct this sample using the coupled distributions of 
(^phot - z spcc )/(l + z spcc ) and M spec /Mphot- For these 
distributions we solely use the GNIRS galaxies in the 
deep fields (excluding ECDFS galaxies). As galaxies 
will scatter in and out of the relevant redshift interval 
(2 < z < 3) from both low and high redshift, we start 
with a photometric sample over a larger redshift interval 
(1.5 < z < 4.0). We randomly apply the coupled distri- 
butions to the original sample, and repeat this process 
1000 times. From the original sample and all 1000 simu- 
lations we construct a mass-limited sample (> 10 11 M Q ). 
We apply the mass-cut after correcting, as galaxies that 
do not make the cut in the original sample can still 
scatter into the corrected sample. The depth of the de- 
selected catalog allow s extraction of an almost complete 
mass-limited sample (jvan Dokkum et ai1 f2006). Finally, 
we divide the sample in redshift bins of Az =0.1 and 
average the simulation to construct the final corrected 
number distribution. Figure [17] presents both the origi- 
nal and corrected number distributions of a mass-limited 
sample as a function of redshift. 

The difference in the original and corrected distribu- 
tion in Figure [IT] shows that photometric studies may 
overestimate the number of massive galaxies by a factor 
of ^1.3 at 2 < z < 3. However, this test is a simpli- 
fication. First, we used the same coupled distributions 
of (zphot - z S pec)/(l + ^spec) and M spec /M p hot randomly 
for all galaxies, while we know that there are correla- 
tions between Az/(1 -I- z) and S/N of the photometry, 
K magnitude, SED type etc. (see § 13. 4j) . Our spectro- 
scopic sample is too small to quantify these correlations 
in an applicable form. Second, our sample and thus the 
distributions of Az/(1 + z) and Af spoc /Af p hot may be in- 
complete. For example, it may be that we miss galaxies 
with a z p hot~ 4 and a z spcc ~ 2. A larger sample over a 
larger redshift range in needed to better understand the 
systematics. 

4.3. Implications for Previous Studies 

At this moment most studies of massive galaxies be- 
yond z > 1.5 rely on photometric redshifts. Thus, the 
systematics found in this work will have consequences for 
previous studies, especially to those that use the same or 
a similar method to derive photometric redshifts. In this 
section we will discuss several studies and how our find- 
ings may affect their results. 

Using a photometric mass-lim ited sample (M > 
10 n M ? ) lvan Dokkum et all (|2006l ) recently showed that 
massive galaxies at 2 < z < 3 have red rest-frame opti- 
cal colors, and found a median J — K and rest-frame 
U — V color of 2.48 and 0.62 respectively. This study 
also contained the deep MUSYC fields (in addition to 
several other deep fields) and used the same photomet- 
ric redshift code as used in this paper. In the previous 
section we showed that the distributions of J — K and 
R — K barely changes when including the spectroscopic 
information. The rest-frame U — V color is slightly red- 
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z 

Fig. 17. — The original and corrected number of galaxies with 
Al > 10 U M Q vs. redshift in the deep MUSYC fields. The original 
distribution is based on z p hot and photometric masses. The cor- 
rected distribution is estimated using the coupled distributions of 
(-Zphot - 2spec)/(l + 2spcc) and M spe c/M pllc , t , as explained in the 
text. This figure illustrates that the number of massive galaxies 
with M > 10 11 Mq i n the range 2 < z < 3 is likely lower than 
previously derived by van Dokkum et al. (2006) using only photo- 
metric information. 

der and more peaked, but this difference is consistent 
with the photometric errors. Thus, our spectroscopic 
study does not change, bu t supp orts the overall conclu- 
sion bv lvan Dokkum et al.l (|2006f ) that the high mass end 
at 2 < z < 3 is dominated by red galaxies. 

Although the colors of massive galaxies at 2 < z < 3 
barely change when including spectroscopic information, 
the number of massive galaxies decreases (see § !4.2p . This 
will be of importance for our un derstanding of the evo- 
lution of the mass d ensity fe.g.. | Dickinson et al.l 120031 : 
iRudnick et all 120031 . 120061 : iDrorv et all l2005h . Due to 
the numerous low redshift spectroscopic surveys, photo- 
metric redshifts below z = 1 are well calibrated and have 
small uncertainties in the order of Az/ (l+z)~ 0.03. This 
results in well-calibrated masses. However, as shown in 
this paper, photometric redshifts beyond z > 1.5 are gen- 
erally poorly calibrated and may show strong systemat- 
ics. These systematics may result in too high redshifts, 
and thus too high stellar masses. For the photometric 
redshift code and template set used in this paper, the evo- 
lution of the global stellar mass density between z ~ 2.5 
and z ~ would have been underestimated by a factor 
of ^1.3, due to the combination of redshift and mass sys- 
tematics. However, this factor is almost entirely driven 
by overestimates of photometric redshifts. As other stud- 
ies may use different z p hot codes and templates sets, they 
may not suffer from this effect. 

Our resul ts will also affect g alaxy clustering stud- 
ies (e.g., iDaddi et alJ 120031 : iQuadri et all l2007at 
iFoucaud et al.ll2007h . Galaxy clustering at high redshift 
is measured using the angular two-point correlation 
function. The angular correlation function can be 
converted into the spatial correlation function using the 
Limber equation. The main ingredient in this equation 
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is the redshift distribution. iQuadri et alj (|2007af ) use 
the summed redshift probability distribution, in order 
to account for different z p hot errors of the individual 
galaxies. In the two bottom panels of Figure [10] 
we compare the summed probability distributions of 
the photometric redshifts of the GNIRS galaxies for 
both the full and deep sample, with the spectroscopic 
distribution. For the continuum redshifts we use the 
probability distribution as well. Although the median 
of the distribution for both samples is slightly offset in 
redshift, the width of the distribution is similar. Because 
the Limber deprojection is more dependent on changes 
to the width of the distribution than the central value, 
the clustering results will not change significantly when 
using spectroscopic redshifts. One can argue that this 
might be coincidence and further study is necessary to 
confirm this. 

5. SUMMARY 

In this paper we present our complete NIR spectro- 
scopic survey for if -selected galaxies at z ~ 2.3. We ac- 
quired high quality NIR spectra (1-2.5 //m) with GNIRS 
for a total sample of 36 galaxies. The galaxies were se- 
lected to be bright in K (<19.7) and have photomet- 
ric redshifts in the range 2.0 < z < 3.0. All galaxies 
have photometric stellar masses > 10 M®. The distri- 
bution of observed J — K, R — K and photometric rest- 
frame U — V colors are similar as those of a photometric 
mass-limited sample extracted from the deep MUSYC 
fields. This suggests that our spectroscopic sample is 
representative for a mass-limited (> 10 11 M Q ) sample at 
2 < Zphot < 3. Our sample is not representative for a 
if-bright galaxy sample, as our redshift distribution is 
slightly higher. Nevertheless, the distribution of photo- 
metric rest-frame U — V colors is similar for the GNIRS 
sample and the full if-bright photometric sample. 

We successfully derived spectroscopic redshifts for all 
galaxies. For 19 of the galaxies we detected rest-frame 
optical emission lines, which provided us with accurate 
redshift measurements. For the remaining galaxies we 
derived redshifts by modeling the continuum spectra in 
combination with the optical photometry. We tested this 
method using the emission-line galaxies, and found a 
scatter of Az/(1 + z) = 0.019, and no significant sys- 
tematic offset. The continuum redshifts are more accu- 
rate for optically red galaxies, with large Balmer/4000 A 
breaks, and for redshifts for which the break falls in an 
atmospheric window. 

Comparison with spectroscopic redshifts shows that 
the original photometric redshifts are generally overes- 
timated by Az/(1 + z) = 0.08, and have a scatter of 
Az/(1 + z) — 0.13. Both the systematic and the scat- 
ter are worse for galaxies with shallow NIR photometry 
(K ~ 20). For the galaxies with deeper NIR photometry 
(K ~ 21) we find a systematic offset of Az/(l + z) = 0.03 
and a scatter of Az/(1 + z) = 0.08. 

In addition to shallow NIR photometry, the lack of well 
calibrated galaxy templates is another important cause 
for the large systematics and scatter of photometric red- 
shifts. Especially, dusty and/or young galaxies have high 
Az/(1 + z). This may not be surprising as the origi- 
nal template set of our photometric redshift code lacked 
these galaxies. We almost completely remove the system- 
atic offset when repeating the fitting procedure replacing 



the original templates by 6 SEDs chosen from fits to the 
spectroscopic sample. We examine other possible causes 
for the scatter and systematics in Az/(1 + z) as well. 
Emission line contamination is only of minor importance 
for this sample and may affect the z p hot for some individ- 
ual galaxies. Furthermore, we find a correlation with K- 
band magnitude, such that if-bright galaxies in general 
have larger Az/(1 + z) than those faint in K. This effect 
may be diminished by using a luminosity prior in the pho- 
tometric redshift code. Although improved photometric 
redshift codes and better templates may remove the sys- 
tematics and decrease the scatter, photometric redshifts 
remain limited by the low resolution spectral shape, and 
are not likely to surpass a scatter of Az/ (1 + z)= 0.05 at 
the targeted redshift range - even for the galaxies with 
deep NIR photometry. We note that obtaining photom- 
etry in narrower bands and for a larger number of filters 
may lead to better results than has typically been the 
case to date. 

The spectroscopic stellar population properties, as de- 
rived by modeling the spectral continuum shape together 
with optical photometry, allow us to examine the signif- 
icance and reliability of the stellar population properties 
derived from the photometry alone. Strikingly, the prop- 
erties age, SFR, t show a scatter of a factor of~ 3 — 4 be- 
tween the photometrically and spectroscopically derived 
properties. For Ay this scatter is ~ 1 mag. The im- 
provement in age and r is driven primarily by the higher 
resolution spectral shape, and less so by the better con- 
strained redshift. For SFR and Ay both effects are about 
equally important. The uncertainties in the photometric 
modeling seem reasonable, as for ~ 3/4 of the galaxies 
the photometric and spectroscopic properties are consis- 
tent within lcr. Nevertheless, the photometric and spec- 
troscopic distributions are quite different, accounting for 
their errors. This may be caused by the fact that age, Ay 
and t suffer from systematics, due to degeneracies with 
the photometric redshift. For example, the photometric 
stellar population properties for several galaxies converge 
to the properties of best-fit templates, used in the pho- 
tometric redshift code. This all implies that the best-fit 
photometric properties and especially their distribution 
are strongly biased. 

The photometrically derived stellar mass and mass-to- 
light (M/Ly) ratio are best determined and show a scat- 
ter of a factor of ~ 2 with their spectroscopic analogues. 
However, the systematic offset in Az/(1 + z) results in 
a systematic offset in -Mphot / '-Mspecj such that Mphot is 
about 30% too high for the galaxies with deep NIR pho- 
tometry. This systematic is canceled out in M/Ly, as the 
Ly iP hot suffers from similar systematics as M p hot- The 
improvement for both stellar mass and M/Ly is about 
equally driven by the higher resolution spectral shape 
and the better constrained redshift. 

Rest-frame V magnitudes and U — V colors are the 
derived properties which are most sensitive to photo- 
metric redshift uncertainties and low S/N photometry. 
The scatter between the photometrically and spectro- 
scopically derived properties is 0.4 and 0.2 mag for V 
and U — V respectively for galaxies with deep NIR pho- 
tometry, and both increase by 50% when we include the 
galaxies with shallow photometry. The systematics in 
the photometric redshifts cause a systematic offset of - 
0.2 and -0.1 mag for V and U — V for galaxies with deep 
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NIR photometry. Thus, for both the scatter and the sys- 
tematics, the improvement is almost completely driven 
by better constrained redshifts. 

In order to assess the impact of the identified systemat- 
ics on complete photometric samples, we compared pho- 
tometric and spectroscopic samples in the same mass and 
rcdshift range. Surprisingly, we found that the distribu- 
tion of J — K and R — K color barely changes for mas- 
sive galaxies (> lO n M ) at 2.0 < z < 2.7. Thus, the 
spectroscopy supports previous studies that red galax- 
ies dominate the high mass end at z <~ 2.5. However, 
the combination of photometric redshift and stellar mass 
systematics also affect the number of massive galaxies. 
The photometric rcdshift code and template set used in 
this work overestimates the number of massive galaxies 
by factor of^l.3at2<z<3, resulting in an underes- 
timation of the stellar mass density between z ~ 2 and 
z ~ 0. 

Overall, our spectroscopic survey demonstrates the 
large uncertainties of best-fit photometric properties, 
and the necessity for accurate calibration of photometric 
studies. Although this study is a step forward in under- 



standing the systematics in photometric studies of mas- 
sive galaxies at z > 2, larger spectroscopic samples over 
a larger redshift range are needed to fully map the sys- 
tematics and accurately calibrate photometric studies. 
Furthermore, this study only applies to the high-mass 
end of the high-redshift galaxy population. Less mas- 
sive galaxies may have other systematics, although that 
remains to be explored. 
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